CHABAUTY'S METHOD PROVES THAT MOST ODD DEGREE 
HYPERELLIPTIC CURVES HAVE ONLY ONE RATIONAL POINT 
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Abstract. Consider the smooth projective models C of curves y 2 = f(x) with f(x) £ Z[x) 
monic and separable of degree 2g + 1. We prove that for g > 3, a positive fraction of these 
have only one rational point, the point at infinity. We prove a lower bound on this fraction 
that tends to 1 as g — > oo. Finally, we show that C(Q) can be algorithmically computed for 
such a fraction of the curves, via Chabauty's method at the prime 2. 



1. Introduction 

In 1983, Faltings proved that if C is a curve of genus g > 1 over Q, then C(Q) is finite 
[Fal83]. Our goal is to study C(Q) as C varies in a family, namely the family T g of hyper- 
elliptic curves y 2 = f(x) for f(x) G Z[x] monic and separable of degree 2g + 1 for a fixed 
g > 1. Although we write an affine equation, we mean the smooth projective model, which 
has one point oo at infinity since deg / is odd. 

Our main results are: 

• For each g > 3, a positive fraction of the C G T g satisfy C(Q) = {oo} (Theorem 10.3). 

• The fraction tends to 1 exponentially fast as g — > oo (Theorem 10.5). 

• Chabauty's method [Cha41, Col85] at the prime 2 is enough to yield an effective 
algorithm to determine C(Q) for C in a computable subset whose density is positive 
for g > 3 and tends to 1 as g — > oo (Corollary 10.12). 

In particular, most monic integral polynomials of large odd degree never yield a square 
when evaluated on rational numbers. This is the first time that Faltings' theorem has been 
made effective for a positive fraction of a "large" family of curves with a rational point. 
(The presence of a rational point makes it impossible to use local methods to prove that 
C(Q) — 0) an d generally tends to make the determination of C(Q) more difficult.) On the 
other hand, the fraction is conjectured to be 1 for all g > 2 (Remark 10.10). 

Our proofs depend crucially on work of Bhargava and Gross on the average behavior of 2- 
Selmer groups of hyperelliptic Jacobians [BG12, Theorems 37 and 44], which in turn built on 
work of Wang [Wanl2] and earlier work by Bhargava and Shankar for elliptic curves [BS10], 
itself preceded by work of de Jong [dJ02] in the function field case. Bhargava and Gross too 
deduced corollaries for C(Q) from [BG12, Theorem 37]: specifically, they proved that for 
each g > 2, there is a positive fraction of C G T g satisfying #C(Q) < 3, and for each g > 3, 
the fraction of C satisfying #C(Q) < 20 is more than 1/2 [BG12, Corollary 4]. On the other 
hand, our arguments are essentially disjoint from those in [BG12]: we use [BG12, Theorems 
37 and 44] only black box. 

To explain how Bhargava and Gross passed from Selmer group information to information 
on C(Q), and to explain why different ideas are needed to obtain our results, we must recall 



Date: January 29, 2013. 



1 



Chabauty's method (see [MP 10] or [Sto06] for a more detailed exposition). Let C be a 
curve over Q embedded in its Jacobian J. In 1941, Chabauty [Cha41], inspired by an idea 
of Skolem [Sko34], proved a weak form of what is now Faltings' theorem, namely that if 
rk J(Q) < g, then C(Q) is finite. Chabauty's approach was to bound C(Q) by C(Q P ) n J(Q) 
inside J(Q P ), where J(Q) is the p-adic closure of J(Q) in J(Q P ). Later, Coleman [Col85] 
showed how to refine Chabauty's argument to obtain an explicit upper bound on #C(Q), 
and improved bounds were given in [Sto06]. These off-the-shelf bounds at odd primes of good 
reduction were enough for Bhargava and Gross to obtain their results for #C(Q) above. 

But it is impossible to reduce the Chabauty upper bound on #C(Q) to 1 if one knows 
only the size of the 2-Selmer group (when it does not force J(Q) to be finite), because there 
is nothing to control the position of J(Q) in J(Q P ). Restricting the family of curves to a 
subfamily defined by finitely many congruence conditions does not help. 

To solve this problem, we resurrect an idea of McCallum [McC94], that knowledge of the 
p-Selmer group Sel p J and its map to J(Q p )/pJ(Q p ) can sometimes be used to extract a 
tiny bit of information on the position of J(Q) inside J(Q P ). McCallum used this idea to 
study the arithmetic of Fermat curves in 1994, but as far as we know, it has not been used 
since. Fortunately, the method of Bhargava and Gross yields not only the average size of 
Sel 2 J, but also equidistribution of the images of its nonzero elements in J(Q 2 )/2J(Q 2 ) as C 
varies (see Section 8.3 for the meaning of this), even if one imposes finitely many congruence 
conditions on C. We will prove that this suffices for the application of McCallum's idea: we 
impose congruence conditions to control the position of C(Q 2 ) in J(Q 2 ) for C with good 
reduction at 2, and apply equidistribution to prove that for g large enough (at least 3), at 
least a small positive fraction of these C have the 2-adic closure J(Q) in a favorable position, 
i.e., intersecting C(Q 2 ) in only one point, so that a 2-adic Chabauty argument succeeds in 
proving C(Q) = {oo}. 

To carry out the argument in the previous sentence, we introduce a reformulation of 
Chabauty's method in which we compute C(Q 2 ) H J(Q) not in J(Q 2 ) directly but only 
after applying a sequence of maps (see (6.1)). Specifically, we prove that if Sel 2 J — > V := 

— — — - ~~77^: — — 1F 2 is injective and the images of certain partially-defined maps 

2 J (Q 2 ) + J (Q 2 )tors 



and 



plog: C(Q 2 ) J(Q 2 ) 5 Z g 2 — > ¥' J - 1 (Q 2 ) 4 F*- 1 ^ 



Sel 2 J ->■ V ~ Ff — + P^ 1 (F 2 



do not meet, then C(Q 2 ) D J(Q) contains only torsion points of odd order (Proposition 6.3). 
Next, we exclude nontrivial torsion points by proving that most hyperelliptic curves do not 
contain any Q 2 -rational torsion points except for Weierstrass points (Corollary 8.6): this 
follows from a new purely geometric statement, that the generic hyperelliptic curve contains 
no torsion points except Weierstrass points (Theorem 7.1). Because Bhargava and Gross 
tell us how many nonzero Selmer elements there are and how they are distributed in V, it 
remains to show that plog(C(Q 2 )) is not too large. When g > 3, this can be arranged for 
a positive fraction of curves by 2-adic congruence conditions on C since it turns out that 
plog(C(Q 2 )) is locally constant as C varies 2-adically, if we exclude curves with unexpected 
torsion points (Proposition 8.7). 
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Now let us sketch how we strengthen the result to obtain a fraction that tends to 1 as 
g — > oo. We must continue to use the prime 2, because the equidistribution is known only 
for the 2-Selmer group, but now we must also consider C with bad reduction at 2, since the 
density of curves with good reduction at 2 tends to a number strictly less than 1 as g — > oo. 
In fact, we must deal with curves with arbitrarily bad reduction at 2, and our task is to prove 
that plog(C(Q 2 )) is usually small. Since log|c(Q 2 ) is computed by integrating 1-forms on 
residue disks, which correspond to the F2-points in the smooth locus C smooth of the minimal 
proper regular model, it suffices to prove that 

(1) the average size of C smooth (F 2 ) is small, and 

(2) the image of plog on each residue disk is small. 

As for (1), the Ogg-Saito formula [Sai88] together with [Liu94, Proposition 1] bounds the 
number c of connected components of C^™ ooth in terms of the Deligne discriminant of C, but 
this discriminant is hard to compute, and it is not known whether c can be bounded in terms 
of the usual discriminant of the polynomial f(x): the best results in this direction we know 
are those in [Liu96, Section 9]. So instead we analyze the random variable ^C smooth (F 2 ) by 
explicitly blowing up non-regular F 2 -points until we have an approximation to C; this leads 
to a recursive analysis of a Bienayme-Galton- Watson-like process (Lemma 9.5). The result 
(Theorem 9.1) is that the average of #C smooth (F 2 ) is at most 3. 

As for (2), this amounts to bounding the image of an analytic curve in P 9_1 (Q 2 ) under 
the reduction map to P S_1 (F 2 ). We do not know of results in the literature on this kind of 
nonarchimedean analysis problem, though it is reminiscent of tropical geometry. To handle 
it, we apply the p-adic Weierstrass preparation theorem to replace the power series defining 
the analytic curve by polynomials, and hence reduce to the case of an algebraic rational curve. 
We reinterpret the map Pq 2 — > Pq" 1 defining this curve as a rational map P| 2 — » P|~\ whose 
indeterminacy at F 2 -points we resolve, the upshot being that our image can be bounded in 
terms of the complexity of a tree of rational curves over F 2 (see Section 3). 

Many of our arguments work also at primes p other than 2, so we work in this more general 
context when possible. On the other hand, the Bhargava-Gross equidistribution theorem is 
known only for 2-Selmer elements, so the final results for higher p must remain conditional 
for the time being. 

2. Notation 
For any field k, let k be an algebraic closure. 

We fix a prime p. As usual, we define 7L p := |imZ/p n Z and its fraction field Q p := FracZ p . 
More generally for any place v of Q, let Q„ be the completion of Q at v. Let C p be the 
completion of Q p , let Oc p be its valuation ring, and let D\ be the open unit disk in C p . Let 
v p be the p-adic valuation on C p , normalized so that v p (p) = 1. 

We fix g e Z>i. For a field k, let P be the usual map k 9 \ {0} F 9 ~ 1 (k). We write p 

for the reduction map P 5_1 (Q P ) = P 5_1 (Z P ) ->■ P 5_1 (F P ) or for the composition Q 9 p \ {0} A 

P 9_1 (Q P ) 4P 9 ^(F P ). If T is a subset of a set S, and / is a function defined only on T, then 
f(S) means /(T); for example, we may write p(Q p ) = P 9 ~ 1 (F p ). 

A variety is a separated scheme X of finite type over a field, and X is called nice if 
it is smooth, projective, and geometrically integral. If X and T are S'-schemes, define 
Xt '■= X x s T; in this context we sometimes write R instead of SpecR. Given a ring R, 
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and / G R[x] of degree 2g + 1, by the standard compactification of y 2 = f(x) we mean the 
i?-scheme Proj R[x, y, z\/(y 2 — z 2g+2 f(x/z)) where degx = degz = 1 and degy = g + 1; 
it can be covered by two affine patches, one isomorphic to y 2 = f(x) and the other to 
y 2 = x 29+2 f(l/x). For any domain R of characteristic not 2, denote by FgiR) the set of all 
nice genus g curves (over Frac R) arising as the standard compactification of 

y 2 = x 2g+1 + a x x 29 + a 2 x 2g ~ x H h a 2g x + a 2g+ i 

for some ax, a 2 , . . . , a 2g+ x G R. This can be identified with R? 9+1 minus the zero set of the 
discriminant of the polynomial in x in the equation above. We set T g ■= J-* g (Z). 

Essentially following [BG12], for C G T g corresponding to (ax, . . . , a 2g+1 ), define the height 
of C as 

H(C) := max{\ax\, [a^ 2 , \a 2g \^ , \a 2g+1 \ 1/(29+1) } . 

(Actually, [BG12] required ax = 0, but this makes little difference: see Remark 8.11. Also, 
their height is the 2g(2g + l)-th power of what we have written.) We set F g ,x '■= {C G T g : 
H(C) < X}. The density of a subset S C T g is fi(S) := hm^^ #(5 n T g ,x)/#F 9 ,x, if 
the limit exists. Define lower density and upper density by replacing lim by liminf or limsup, 
respectively. If 5* C T C T g and fi(T) > 0, the relative density of S in T is fi(S)/fi(T), if 
fi(S) exists; similarly define relative lower density and relative upper density. If / : T g — > R is 
a function, then we say that / has average a on T g if 

hm — = a. 

X^OO #J-g,X 

We say that the average of / is at most a if the limsup is at most a. Similarly define the 
average of a function on an infinite subset of T g . 

Restrict the normalized Haar measure on Z 2g+1 to obtain a probability measure on J-" g (Z p ). 
Let P(S) be the probability of an event S. For a random variable X defined on .F s (Zp), let 
~EiX denote its average. If X is a random variable defined only on a positive-measure subset 
S of J-* g (Zp), then EX denotes the average of X conditioned on the event S. 

3. Images of curves under reduction 

3.1. Algebraic curves. By the degree of a morphism 0: C — > P 9 " 1 , where C is a nice curve, 
we mean deg <fi*<ff(\). If fx, . . . , f g are single- variable polynomials of degree at most n, not 
all zero, then the rational map (fx fg)'- ^ 1 P 9 " 1 extends to a morphism P 1 — > P^ 1 
of degree at most n. 

Proposition 3.1. Let <\>: P<^ -»■ P^ 1 6e o/ decree n. T/ien ^(^(PHQp))) < np + 1. 



Proof. After iteratively blowing up F p -points on P^ p , we obtain a proper regular Z p -scheme 
S" such that extends to a rational map 0': 5" — » P|~ X defined at all F p -points of S". If we 
also blow up closed points of higher degree, we obtain S" such that <fi extends to a morphism 
0": -> P|~\ as in [Lic68, II.D, Proposition 4.2]. Then P X (Q P ) = S"(Z P ) = S'(Z P ), so 
p(<t>(¥\Q p ))) V = p(ct>'(S>(Z p ))) C 0'(5"(F P ))- 

The construction of S" shows that S 1 ^ is a strict normal crossings divisor whose components 
are copies of P^ meeting at F p -points. Name these components S[, S' 2 , . . . in the order that 
they were produced by the blowing up, starting with S[ being the strict transform of Pp . 
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Since cleg 0^*^(1) = deg 0*^(1) = n, the morphism 0" is non-constant on at most n 
components of , so 0' is non-constant on at most n of the sets S'A¥ P ). Let S = {P} for 

some P G S^Fp), and for i > 1, let E, := Uj = i ^O^p)' ^ e ^ ave #0'(^o) = 1- Incrementing 
i increases #0'(Ej) by if 0' is constant on S , j' +1 (F P ) and by at most p otherwise, since one 
of the p + 1 points of S' i+1 (¥ p ) was already in Ej. Thus #0'(Ej) increases at most n times, 
by at most p each time, starting from 1. Hence #0'(S"(F p )) < np + 1. □ 

Remark 3.2. For each (n,p), the bound in Proposition 3.1 is sharp: for any g > n, define 
by taking : = p^ -1 -^ -1 for 1 < j < n + 1, and : = for n + 1 < j < g; then 

p(0(P 1 (Q p ))) is a chain of n lines in P S_1 (F P ). 

Remark 3.3. The proof of Proposition 3.1 suggests a down-to-earth algorithm for computing 
p(0(P 1 (Q p ))). Namely, one iteratively subdivides P 1 (Q P ) into disks until po0 is constant on 
each. 

3.2. Analytic curves. 

Definition 3.4. Let w = (wi, W2, ■ ■ ■ ,w g ) G C p [£] 9 — {0}. Let Wj tU be the coefficient of t n 

in Wj. Define the Newton polygon NP(w) as the lower convex hull of { (n, v p (wj jn )) : 1 < j ' < g, n > 0}. 

Suppose that the minimum of the ^/-coordinates of the vertices of NP(w) is attained; then 

for the vertices attaining this minimum, let n w and iV w be the minimum and maximum 

x-coordinates (the maximum might not exist). When w consists of a single w, we also write 

NP(iu), n w , N w . 

Remark 3.5. The Newton polygon NP(w) depends only on the Z p -span of the Wi. 

Remark 3.6. Let w G Q p [t] 9 — {0}. Let R be the valuation ring of an unramified extension 
of Q p . If Ai, . . . , X g G R have F p -independent images in R/pR, then 

NP(w) = NP(Aiiwi + • • • + XgW g ). 

Remark 3.7. For w G C p [t] — {0} for which n w is defined, the theory of Newton polygons 
[Kob84, Corollary on p. 106] implies that n w = # (zeros of w on Di); we write #( ) instead 
of } to indicate that we are counting zeros with multiplicity. 

Following [Sto06, Section 6], for n > 0, define 

S(p, n) := max{c? > : v p (n + 1) + d < v p (n + d + 1)} . 

Proposition 3.8. Suppose that w G Q p [t] 9 \ {0} has p-adically bounded coefficients. Let 
t G h v W be such that dt/dt = w. Then #p(£(pZ p )) < p(n w + 1 + S(p, n w )) + 1. 

Proof. Let L(t) := £(pt). The lattice point responsible for the value of Ni, is a vertex of 
NP(Lj) for some j. We ensure that it is a vertex of NP(Lj) for every i by adding Wj to each 
Wi for which this does not yet hold (this has the effect of applying a linear change of variable 
to the codomain P 3_1 (F P ) of p o £, but does not change the size of the image). By the p-adic 
Weierstrass preparation theorem (see [Kob84, p. 105, Theorem 14] and its proof), Li = f\ui 
for some fi G Q p [t] of degree TVl and Ui G 1 +ptZ p [t] converging on 7L p . For r 6 Z p , we have 
Uiij) G 1 + pZ p , so p(L(r)) = p(f(r)). Hence p(£(pZ p )) = p(L(Z p )) = p(f(Z p )), which has 
size at most pNj, + 1 by Proposition 3.1 applied to the morphism P 1 — > ps 1 " 1 defined by f. 
Finally, we prove ./Vl < n w + 1 + 5(p, n w ): by Remark 3.6, we may reduce to the case g = 1, 
with coefficients now in an unramified extension; this case is [Sto06, Proposition 6.3]. □ 
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4. The logarithm map 



Let J be an abelian variety over Q p . Let To J be the tangent space to J at 0. Integrating 
1-forms defines an analytic group homomorphism log: J(Q P ) —> T J ~ Qp 1 whose kernel is 
the torsion subgroup J(Q p )tors- Since log is a local diffeomorphism and J(Q P ) is compact, 
its kernel J(Q p ) t0 rs is finite, and its image will be Z^ for a suitable choice of identification 
T J ~ Q^; this identification corresponds to a choice of basis Ui,...,u g of H°( J, fi 1 ), which 
we now fix. Group theory then implies that J(Q P ) — Z^ x J(Q p ) t0 rs as topological groups. 
Let plog be the composition 

J(Q p )^Z^-^P^(F p ), 

defined on J(Q P ) \J(Q 

5. Image of the curve under the logarithm map 

For this section, let C be a nice curve of genus g > 1 over Q p with a Q p -point oo. Embed 
C in its Jacobian J by sending oo to 0. Define log as in Section 4. Let C — > SpecZ p be the 
minimal proper regular model of C. 

5.1. Image of one residue disk. 

Definition 5.1. A residue disk D C C(C P ) is the preimage of a point P £ C smooth (F p ) under 
C(Cp) = C{0 Cp ) -»■ C(F P ); then let D(Q p ) :=Dn C(Q P ). A uniformizer for D is a regular 
function £ on an open neighborhood of P in C such that £ reduces to a uniformizer at P on 

Every point of C(Q P ) reduces to a point of C smooth (F p ), so C(Q P ) is the disjoint union of 
the open sets D(Q P ). 

Let D be a residue disk with uniformizer £. Then £ defines a diffeomorphism D — > D\ 
identifying D(Q p ) with pZ p . The restriction of any u £ H (C, Q 1 ) to D corresponds to an 
analytic 1-form w(t) dt on D\, for some w £ Q p [£] with bounded coefficients. Applying this 
to U)i\c defines some Wj. Let w := [w\, . . . , w g ) and n D := n w . 

Proposition 5.2. We /iave #plog(D(Q p )) < p(nn + 1 + <5(p, n D )) + 1. 

Proof. Since log is defined by integrating . . . ,u g ), the composition 

is some £ £ Q p {t} 9 with d£/dt = w. Then 

#plog(£>(Q p )) = #p(*(pZ p )) < p(n D + 1 + 5(p, n D )) + 1 
by Proposition 3.8. □ 

5.2. Image of many residue disks. 

Lemma 5.3. Let T> be the set of residue disks on C . Then ^De© 7 ^ — 2# — 2. 
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Proof. Let Ai, . . . , X g be as in Remark 3.6. Let u = Yli=i K^i\c £ H°(C Cj) , fi 1 ). On a residue 
disk D with uniformizer t, express u as w(t) dt, so w = Yli=i ^% w i £ C p [£]; then 

n D = = n w = # (zeros of w on £>i) = # (zeros of w on D) , 

by Remark 3.7. Thus 

no < # (zeros of a; on C(C P )) = 2g — 2. □ 



r>e£> 

Proposition 5.4. Lei d := #C smooth (F p ). T/ien 

(5d + 6#-6, if p = 2, 

#plog(C(Q p ))< MU , M P 2 -P (9 , -r ^ 9 

(p + l)d+ -(20-2), rfp>2. 

I p - 2 

Proof. Sum the bound of Proposition 5.2 over all -D and use Lemma 5.3 to obtain 
#p log(C(Q p )) < p(2g -2) + (p+l)d + pY^ 6(p, n D ) . 

D 

If p = 2, use the bound 5(2, n) < 1 + n/2 (and use Lemma 5.3 again) to conclude. If p > 2, 
then J^d ^(Pi u d) < A p (d, 2g — 2), where 

d d 

A p (d,N) := max £(p, rij) : rij G Z> and ^^Uj < iVj-; 

j'=i j"=i 
use the bound A p (d, iV) < N/(p- 2) of [Sto06, Lemma 6.2]). □ 

Remark 5.5. When C is hyperelliptic, with hyperelliptic involution t, one can exploit the 
identity plog(t(P)) = plog(P) to cut the bound of Proposition 5.4 approximately in half, 
after doing some extra work to handle t-invariant residue disks. 

6. Image of the rational points under the logarithm map 

From now on, let C G T g , and embed C in its Jacobian J using oo. Taking Galois 

cohomology of — > J[p] — > J -4- J — > over Q and over Q v for all places v of Q yields the 
rows in the following commutative diagram: 



H^Q, J[p]) 



The p-Selmer group of J is defined by 

Sel p J := { £ G H X (Q, J[p]) : res(£) G im(<5') }. 

In particular, res restricts to a homomorphism Sel p J — > J(Qp)/pJ(Q p ). Since the p-adic 
closure J(Q) contains a finite-index closed subgroup that is a free Z p -module of finite rank, 



J(Q)/pJ(Q) -> J(Q)/pJ(Q) is surjective. Choose log: J(Q P ) -» Z£ as in Section 4. Then 
we have a diagram 



(6.1) 




in which a and Pa are defined as the compositions, so that the diagram commutes on elements 
for which the maps are defined. 

Lemma 6.2. If a is injective, then J(<Q>)[p°°] = and plog(J(Q)) C Pcr(Sel p J). 

Proof. The group J(Q) tors is killed by log, so it maps to in F^. If a is injective, then a 
diagram chase in (6.1) shows that J(Q) tors must map to also in J{Q)/pJ(Q). On the other 
hand, J(Q) tors is contained in the finite group J(Q p )tors, so it is finite. The previous two 
sentences imply J(Q)[p°°] = 0. 

Since p is locally constant on the open set Z£ \ {0}, we have plog(J(Q)) = plog(J(Q)). 
Given non-torsion P £ J(Q), write P = p n Q with Q £ J(Q) \pJ(Q) and n £ Z> ; then 
(6.1) yields plog(P) = plog(Q) = P<r(5(Q)). □ 

Let J(Q p )\p'] be the set of points of finite order prime to p in J(Q p ) t0 rs- 

Proposition 6.3. If a is injective and the images plog(C(Q p )) and P<r(Sel p J) are disjoint, 
thenC(Q p )nl(^j C J(Q p )[p']. 

Proof. Under the hypotheses, the second part of Lemma 6.2 implies that C(Q P ) and J(Q) 
do not meet inside the domain of plog. In other words, C(Q P ) fl J(Q) C J(Q p ) t ors- The first 
part of Lemma 6.2 shows that all elements of J(Q) tors have order prime to p. □ 

7. Torsion points on a generic hyperelliptic curve 

Theorem 7.1, which we hope is of independent interest, shows that the only torsion points 
lying on a generic hyperelliptic curve are its Weierstrass points. 

Theorem 7.1. Let C be a generic hyperelliptic curve of genus g > 1 over a field k of 
characteristic 0; i.e., the image of the corresponding morphism from Spec k to the moduli 
space over Q is the generic point. Assume that C has a k-rational Weierstrass point, which 
is used to embed C in its Jacobian J. Then C{k) D J(k) tOTS consists of only the Weierstrass 
points. 
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Before giving the proof in detail, let us explain the strategy. If P is a torsion point of order 
n on C, then so are all its Galois conjugates. Because of "big monodromy" , there are many 
such Galois conjugates. Taking combinations of these yields a principal divisor associated 
to a rational function of low degree on C. Such functions are fixed by the hyperelliptic 
involution, and this will force P to be a Weierstrass point. 

Proof. We may assume that C is the curve y 2 = Yld^i i x ~ a «) over ^ := Q( a i; • • • > a 2 g +i), 
where the Oj are indeterminates, and that C is embedded in J using the Weierstrass point oo. 
By [A'C79, Theorem 1], the geometric monodromy group contains ker(Sp 2g (Z) — > Sp 29 (Z/2Z)) 
For n > 1, let I n be the image of the geometric monodromy group in GL 29 (Z/nZ). Suppose 
that n = 2 e m where e G Z> and m is odd. Strong approximation for Sp 2g (Z) shows that 
In — h £ x Inn where I 2 e contains diagonal matrices mapping the first standard basis element 
of (Z/2 e Z) 29 to any odd multiple, and I m contains Sp 29 (Z/mZ), which acts transitively on 
points of exact order m. 

Now suppose that P G C(k) is a torsion point of exact order n. Then P is the first vector 
in a symplectic Z/nZ-basis of J[n], say P,Q, . . .. Decompose P as P 2 e + Pn where P 2 e G </[2 6 ] 
and P m G J[m]. Decompose Q similarly as Q 2 e + Qm- Then we may find elements of /„ 
mapping P = P 2 e + P m to Pi := P 2 e + <5m, P2 := p2 e + 2P m , and P 3 := P 2 e + (P m + Q m ). 
Then P + P 3 — R x — R 2 , viewed as a divisor on C^, is principal. If m > 3, then P, Ri, R 2 , R3 
are all distinct, so the corresponding rational function is of degree 2, which implies that its 
divisor is fixed by the hyperelliptic involution, a contradiction. Thus m = 1, so n = 2 e . If 
e > 3, then we may find elements of I n mapping P to Si := (2 e_2 + 1)P, S 2 '■= (2 • 2 e_2 + 1)P, 
and S3 := (3 • 2 e ~ 2 + 1)P, and then the divisor P + S\ — S 2 — S3 yields a contradiction as 
before. Thus e < 2, so n < 4. 

Now nP — noo is the divisor of some h G k[x] + k[x]y. The valuation v at 00 satisfies 
= —2 and t> (|/) = — (2g + 1) < —4, so /i G Thus the hyperelliptic involution fixes 
/;., so it fixes nP — noo, so it fixes P. In other words, P is a Weierstrass point. □ 

Remark 7.2. Our application to 2-adic Chabauty needs to consider only torsion points P of 
odd order n in Theorem 7.1. There is a simpler proof in this special case: the point Q := 2P 
is a Galois conjugate of P, and the divisor Q + 00 — 2P is principal. 

8. Families of curves and Jacobians 

8.1. Abelian scheme over a p-adic variety. Let M be a smooth variety over Q p . Let 
J? — > M be an abelian scheme. Given m G M(Q P ), let be the fiber, an abelian variety 
over Q p . If ^^r/ M is a free iff ^-module with basis u±, . . . ,u g , then fiberwise integration of 
these 1-forms defines a local diffeomorphism Log: J? (Q p ) — > Q p x M(Q p ) that is fiberwise 
the homomorphism log of Section 4. 

Definition 8.1. For an open subset U C M(Q P ), let ^(Q p )(7 be its inverse image in ^(Q p ). 
By a trivialization of ^(Qp) — > M(Q P ) above £/, we mean a diffeomorphism 

r: /(Q p )^(Z»xF)xf/ 

over U, for some finite abelian group F, such that 

• for each m G U, the fiber r m : ^m(Qp) ^ x f is an isomorphism of p-adic Lie 
groups, and 
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• after replacing M by a Zariski open subset whose set of Q p -points still contains U, 
the composition of r with the projection to 7A x U agrees with a map Log defined 
as above. 

Proposition 8.2. The base M(Q P ) can be covered by open subsets U above which (Q p ) — > 
M(Q P ) can be trivialized. 

Proof. Since J? — > M is an abelian scheme, Q 1 ^ j M is trivialized on the preimages of some 
Zariski open sets covering M. Thus we may reduce to the case that Q^/m * s ^ ree - Choose 
a basis to define Log. 

Let m e M(Q P ). Change the basis above so that Log( c / mo (Q p )) = 1P p . Since ^{Qp) -> 
M(Q P ) is a proper and open map, for each compact open subgroup H < Q 9 , the locus of 
m G M(Q P ) such that Log( c ^ m (Q p )) C H is open and closed. The same is true for the 
locus where Log( cJ / m (Q p )) equals H, because a subgroup of equals H if and only if it is 
contained in H and not contained in any of the finitely many maximal subgroups of H. In 
particular, we can find an open neighborhood U of m,Q in M(Q P ) such that Log(^f (Q p )u) = 
IP x U. Since Log is a local diffeomorphism, after shrinking U, we can find analytic sections 
Si, . . . , s g of J?(Q p )u U such that Log(sj) = {e^} x U for each i, with ej the standard 
basis vector. Each group ^ m (Q p ) factors canonically as a finitely generated Z p -module and a 
prime-to-p finite group; if we replace each Sj by its projection onto the Z p -module, fiberwise, 
then we may define a fiberwise Z p -module homomorphism <fi: x U — > (y f{Q p )u sending 

to Si. 

The map from the torsion locus ^(QpVrs = Log _1 ({0} x M(Q P )) to M(Q P ) is proper 
(since ^(Q p ) — >■ M (Q p ) is proper) and a local diffeomorphism (since Log is), so it is a locally 
constant family of finite abelian groups over M(Q p ). Shrink U so that this family is constant, 
say equal to F, above U. Thus we obtain a fiberwise homomorphism i/j: F xU —> a fl(Q p )u- 
The product of </> and ip (over U) is a fiberwise isomorphism (7A x F) xU —> cy f{Q p )u whose 
inverse is a trivialization r above U. □ 

8.2. The universal family of hyperelliptic curves. Let M. be the moduli space over Q 
such that M.(k) = F g (k) for each field extension fc/Q; more precisely, M. is the complement 
of the discriminant locus A = in A 2g+1 . Then F g (Z p ) = Z 2 p 9+1 n Al(Qp). Endow J* s (Z p ) 
with the Haar measure from l? p 9+x . By a congruence class in J-" 9 (Z P ), we mean a coset U of 
(p e Z p ) 2s+1 in with U C F g (7L p \, for such [/, the density of D t/ equals the measure 

of [/. 

Let 7r : — > Ai be the universal curve; 7r is a smooth proper morphism whose fibers are 
nice hyperelliptic curves of genus g. Its relative Jacobian is an abelian scheme J? — > M. 
[BLR90, p. 260, Proposition 4]. Call a congruence class U C F g (JL p ) trivializing if ^(Q p ) — > 
_M(Q P ) is trivialized above U. 

Lemma 8.3. Any open subset o/J r 9 (Z p ) is a disjoint union of trivializing congruence classes. 

Proof. The congruence classes in J-" 9 (Z P ) form a basis for its topology. By Proposition 8.2, 
the same is true for the trivializing congruence classes. Thus any open subset V of J-" 3 (Z P ) 
is a union of trivializing congruence classes C/j. If two congruence classes meet, then one 
contains the other. Thus V is the disjoint union of the Ui not contained in a larger one. □ 

Proposition 8.4. The density of C 6 T a such that J(Q)tors ^ is zero. 
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Proof. By Lemma 8.3, it suffices to prove the result for the C G T g belonging to one trivializ- 
ing congruence class U. For such C, the size of J(Q p )tors is constant, say n. The monodromy 
action does not fix any nonzero torsion point on the geometric generic fiber of J? — > M., so 
the Hilbert irreducibility theorem shows that the density of such C such that J(Q) has a 
nonzero point of order dividing n is zero. □ 

Using the section oo : M. — > *«f , we identify with a closed subscheme of Jf . Let W ^ c io 
be the locus of Weierstrass points. Then ~W is Zariski open and closed in [2] . Let Z be 
the set of m G F g (l* p ) such that ^ m (Q p ) fl (3 / m (Q p )tors is not contained in W m (Q p ). 

Proposition 8.5. The set Z is closed in TgiTLy) and is of measure zero. 

Proof. By Lemma 8.3 applied to J-" ff (Z p ), we may restrict attention to a trivializing congru- 
ence class U. Let n be the constant value of # ^m(Q P )tors for rn £ U. Let Z' be the image of 
the Qp-points under the restriction n n : n {/[n] - W) ->• .M of vr. Then Z n C/ = Z' n C/. 
Since ^ — > M. is proper and ^ [n] \ W — > Ai is finite etale, the morphism 7r n is proper, so 
Z' is closed. By Theorem 7.1, 7r n is not dominant, so Z' is of measure zero. □ 

Corollary 8.6. The set of C G T g such that C(Q P ) fl J(Q p ) t0 rs contains a non- Weierstrass 
point is of density zero. 

Proof. Apply Lemma 8.3 to J-* g (Z p ) \ Z to show that the complement has density 1. □ 

Proposition 8.7. Let U be a trivializing congruence class. Let Z be as in Proposition 8.5. 
Let U' := U \ Z . Then plog(^ m (Q p )) in P 3_1 (F P ) is locally constant as m varies in U' . 

Proof. We have analytic maps of p-adic manifolds 

(8.8) V(Q p )ui — ► ^Z^xU'^Z 9 P^(Q P ) -A ^(Fp), 

except that the dashed arrow is indeterminate at 0. We resolve the indeterminacy by blowing 
up the first four manifolds along the inverse image of G Zp\ The inverse image in *Jf (Q p )[// 
is the torsion locus, which by definition of U' equals only W{Q, p )u'i which is a smooth 
divisor on ^(Q p )u'- Thus the blow-up of c £'(Qp)u' is ( &(Q P )u' itself, so (8.8) extends to a 
continuous map e: ^(Q p );7' — ► P 9_1 (F P ). The fibers of e are open and closed. So are are 
their images in U' since ^ — >■ .M is smooth and proper. The locus in U' where e(^ m (Q p )) 
equals a given subset of P 9_1 (F p ) is a finite Boolean combination of these images, hence 
again open and closed. Thus e(^ m (Q p )) is locally constant as m varies in U' . Finally, plog 
is just the restriction of e to a dense open subset of ^ m (Q p ), and e is locally constant, so 
plog(^ m (Q p )) = e(^ m (Q p )). □ 

Remark 8.9. The set plog( < ^' m (Q p )) is generally not locally constant in a neighborhood of 
points of Z. 

8.3. Equidistribution of Selmer elements. Let U be a trivializing congruence class. If 
m G TgdU, the trivialization lets us construct the diagram (6.1) for C := ^ m and J : = 
in particular, we obtain a: Sel p J — » F£. 

For each g > 1 and prime p, we may now formulate the following equidistribution conjec- 
ture, compatible with both the heuristics in [PR12] and the theorems for p = 2 in [BG12]. 

Conjecture Eq g (p). For any trivializing congruence class U and any w G Fp", the average 

size of {s G Sel p J \ {0} : a(s) = w} as C varies in T g D U is p x ~ 9 . 
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Theorem 8.10. For each g > 1, Eq g (2) holds. 

Proof. By [BG12, Theorem 37] (adapted as in Remark 8.11 below), the average size of 
Sel2 J \ {0} for C E JF g D U is 2. The trivialization identifies each group J(Q 2 ) with a 
fixed group Tj 9 2 x F, and [BG12, Theorem 44] states that the images of the nonzero Selmer 
elements under Sel 2 J -)> J(Q 2 )/2J(Q 2 ) ~ F^ x F/2F are equidistributed in Fg x F/2F, so 
their images under the projection to ¥ 2 are equidistributed too. Thus on average there are 
2/4f$2 = 2 1-9 nonzero Selmer elements mapping to a given element of F|. □ 

Remark 8.11. Let J 7 ' be the subset of T a where a\ = 0, and let T'' be the subset of J 7 ' where 

y " y y 

there is no prime p with p 2m \a m for all m. The paper [BG12] works not with T g , but with 
T". Here we explain how to transfer the equidistribution results from J 7 '' to T a . 

i/ y " 

First, given subsets 

^oo := [63, &' 2 ] X • • • X [6 2g+1 , &' 2s+1 ] C IR 29 , 

and _B p a coset of a finite-index subgroup of l? v 9 for p in some finite set S, call B : = 
-Boo x ripes B p a box. For X > 0, consider the curves in T' g whose coefficient tuple satisfies 
[a 2 /X 2 , . . . , a 2g +i/X 29+1 ] G and (a 2 ,...,a 2fl ) G B p for all p E S. The arguments of 
[BG12] carry over essentially without change to prove equidistribution of Selmer elements 
for such curves as X — > 00. By taking finite linear combinations, one obtains a variant that 
counts the same curves but with a weight that is a step function that is finitely piecewise 
constant on sub-boxes of B. Then one can do the same for any bounded weight function w 
on B that for every e > can be bounded above and below by step functions both within e 
of w off of sub-boxes of total measure less than e. 

Let S be a finite set of primes including those dividing 2g + l. The "complete the (2g + l) st 
power and scale" map sending f(x) to (2g + l) 2g+l f((x — a\)/{2g + 1)) defines a map on 
coefficient tuples from [— 1, 1] 29+1 x Yl p£ s'^p 9+l some box B, and the pushforward of 
the uniform measure is a weight function w as above. The corresponding map from T 9r x_ to 
jF' g has fibers whose size is approximately proportional to w at the corresponding / G JJ, 
normalized, so the limit of the average for T g equals the limit of the weighted average for T' g . 
(One can also impose finitely many congruence conditions by replacing each 7* 29+1 by a coset 
of a finite-index subgroup.) 

Remark 8.12. Although Eq g (p) is about p-adic equidistribution of p-Selmer elements, one 
could also ask about f-adic equidistribution of p-Selmer elements for any place v of Q. In 
fact, [BG12, Theorem 44] proves u-adic equidistribution of 2-Selmer elements for all v, and 
can even handle finitely many v simultaneously. 

8.4. A general density result. 

Proposition 8.13. LetU be a trivializing congruence class such that the subset plog( < ^ m (Q p )) C 
P 3_1 (F P ) is constant for m E U, say equal to I . Then, for C E T g D U outside a subset of 
relative upper density at most (1 + ^/)j9 1 ~ 5 ', we have: 

(i) ifp = 2, then C(Q) = C(Q 2 ) D J(Q) = {00}. 

(ii) if p > 2 and Eq g (p) holds, thenC(Q) = {00} and C(Q P ) fl J(Q) consists of Weierstrass 
points. 
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Proof. By Theorem 8.10, Eq 9 (2) holds; if p > 2, assume Eq (p). Then for C G T g D C/, the 
average number of nonzero elements of Sel p J mapped by a to is p 1 " 9 , so the relative upper 
density of C having such a Selmer element is at most p l ~ 9 . Similarly, the average number of 
nonzero elements of Sel p J mapped by Pa into I is (p — Vj^Ip 1 ^ 9 , but each curve with such 
an element has at least p — 1 such elements (its nonzero multiples), so the relative upper 
density of such curves is at most #/p 1-9 . Together, these have relative upper density at most 
(1 + #/)p 1 " 9 , and this is unchanged if we include the density zero sets of Proposition 8.4 
and Corollary 8.6. 

For the other C G FgOU, Proposition 6.3 states that C(Q P ) n J(Q) C J(Q p )[p']. Since 
we excluded the set of Corollary 8.6, C(Q P ) n J(Q) consists of Weierstrass points. For 
p = 2, these together imply C(Q P ) PI J(Q) = {oo}. For all p, our exclusion of the set of 
Proposition 8.4 implies C(Q) = {oo}. □ 

Remark 8.14. By Lemma 8.3 and Proposition 8.7, there is a disjoint union of sets U satisfying 
the hypothesis of Proposition 8.13 and having total measure 1. 

Remark 8.15. We expect that for p > 2, there is a positive density of C G T g with C(Q P ) n 
J(Q) strictly larger than {oo}. The reason is that we expect that there is a positive density 
of C such that C has good reduction at p, there is a Weierstrass point W G C(Q P ) \ C(Q), 
and there is a point P G J(Q) with the same reduction as W. For such curves, p n P — > W 
as n — )■ oo, so W £ C(Q P ) n J(Q). 

9. Average number of residue disks 

In this section, we fix g > 1 and a prime p. For a random C G ^(Zp), let C be its minimal 
proper regular model. The main goal of this section is the following. 

Theorem 9.1. We have E#C smooth (F p ) < p + 1. 

Because it is difficult to construct C explicitly, we construct a model with a weaker prop- 
erty. Call a Z p - model D of C decent if T> is proper over 7L P and the image of C(Q P ) = 
V(Z p ) -> D(F p ) is contained in £> smooth (F p ). 

Lemma 9.2. IfV is decent, then #C smooth (F p ) < #£> smooth (F p ). 

Proof. Let it: £ — > T> be the minimal desingularization of V; this is an isomorphism above 
£,smoot\ Also, if e G £ smooth (F p ), then e is the reduction of a point in C(Q P ) by Hensel's 
lemma, and V is decent, so vr(e) G £> smooth (F p ). Thus vr defines a bijection £ smooth (F p ) -> 
P smooth (F p ). On the other hand, £ — > C factors as a sequence of blow-ups at closed points 
[Lic68, II. A, Theorem 1.15], and each blow-up morphism is surjective on F p -points. □ 

By Lemma 9.2, to prove Theorem 9.1 it suffices to construct a decent model T> of each 
C G ^(Zp) and to prove E#D smooth (F p ) < p + 1. We use the following recursive algorithm 
to construct T>. 

Algorithm MakeDecentModel(C). 

Input: A curve C : y 2 = f(x) in ^(Zp). 

1. Let T> be the standard compactification of y 2 = f(x) over Zp. 

2. Let U be the closed subscheme y 2 = f(x) of A| with its open immersion into T>. 
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3. Modify T> by running Fix({7) below. 



Subroutine Fix(C7). 

Input: A closed subscheme U : y 2 = h(x) of A| with an open immersion into T>. 

1. Replace V by its blow-up at the set of non-regular F p -points of U. 

2. For each c6 {0,1,... ,p — l}, if p\h'(c) and p 2 \h(c), then let U c be y 2 = p~ 2 h(c+px), which 
is an affine patch of the blown- up T>, and run Fix(U c ). (These processes for different c 
may be run independently without interference, since the special fibers of the U c have 
disjoint images in D.) 

Lemma 9.3. Algorithm MakeDecen.tModel(C) terminates and yields a decent model V ofC. 

Proof. If the recursion reaches nesting depth n (where the initial call to Fix([7) is nesting 
depth 0), then the composition of the changes of variable x i— > c+px is of the form x t— > d+p n x 
for some (I G {0,1,...,/ - 1} such that p~ 2n f(d + p n x) G Z p [x], and hence f(d) G p 2n Z p 
and f'(d) G p nr L v . Thus if the nesting is unbounded, compactness yields d G Z p such that 
f(d) = f'(d) = 0, contradicting the definition of J-* g (Z p ). Hence the algorithm terminates. 

Suppose that P G C(Q P ). If x(P) ^ Z p , then P reduces to the smooth point oo on the 
special fiber of T>. Otherwise, P belongs to U (Z p ) for the initial U. Consider the last time 
Fix({7) is called with a U such that P G U (Z p ). Without loss of generality, make a change of 
variables x \— > x + c to assume that P reduces to a point in {7 (F p ) with x — 0. Let a = /i(0) 
and 6 = fe'(0). 

(1) Suppose p { b. Then C7 is smooth at the F p -points with x = (the x-derivative is 
nonzero) . 

(2) Suppose p\b. 

(a) Suppose p\ a. 

(i) Suppose p 7^ 2. Then {7 is smooth at the F p -points with x = (the 
y-derivative is nonzero). 

(ii) Suppose p = 2. Then [7 is isomorphic to + 2y = (a — 1) + bx H — • , and 
has a unique F 2 -point u with x = 0. 

(A) If a = 3 (mod 4), then [/ is regular but not smooth at u, so P could 
not have existed. 

(B) If a = 1 (mod 4), then [/ is not regular at it, so u was blown up in 
Step 1 of Fix(C7); then P corresponds to a Z2-point of the affine patch 
U' : y 2 + y = a' + b'x + ■ • • of the blow-up obtained by making the 
change of variable (x, y) t— > (2x, 2y) and dividing by 2 2 ; this entire 
patch is smooth (the y-derivative is nonvanishing) . 

(b) Suppose p\a but p 2 \ a. Then U is regular but not smooth at the unique F p -point 
with x = (the origin), so P could not have existed. 

(c) Suppose p 2 \a. Then in Step 2 of Fix(U), we would have called Fix({7 ) f° r U 
such that P G C/ (Z p ), contradicting the assumption on U . □ 

Construct V by algorithm MakeDecentModel. For n > 0, let K n be the set of polynomials 

f( X ):= p (29-l)n x 2g+l + p (2 9 -2)n a2sa ,2 9 + . . . + p n a33; 3 + fl2X 2 + flia . + ^ 

with Oo, . . . , a2g G Z p , and let K n be the (full measure) subset with nonzero discriminant. 
Identify each / G % n with the standard compactification of y 2 = f(x) over Q p ; for example, 
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H ~ TgC^p). Let S n := {h G "H n : p\ai andp 2 |ao}; define S n similarly. The bijection 
S n — > H n+ i sending h(x) to p~ 2 h(px) respects addition, so it respects Haar measure (up to 
normalization). Thus, by induction on n, inside a call to subroutine Fix(U) at nesting depth 
n arising from a sequence of choices c±, . . . , c n in Step 2 of earlier calls, the distribution of 
h is uniform over "H n . Let X n be the random variable on "H n that counts the number of 
smooth Fp-points of the final T> lying above F p -points in this U having x — 0; if we replaced 
by any other c G {0, 1, . . . ,p — 1}, the distribution of values would be the same. 

Lemma 9.4. For n > 0, the restriction X n \$ n is the sum of p random variables, each of 
which has the same distribution of values on S n as X n+ \ has on H„+i . 

Proof. For h G S n , Step 2 of F±x(U) leads to U' : y 2 = p~ 2 h(px), and the points of X? smooth (F p ) 
lying above points in U with x = are all those lying above U'. The number of these whose 
image in U' has a particular x-coordinate in F p is distributed like X n+ i on % n+ \. □ 

Lemma 9.5. For n > 0, we have EX n = 1. 

Proof. We divide H n into subsets corresponding to the cases in the proof of Lemma 9.3. For 
each case, we compute its probability, and the average contribution to X n conditioned on 
being in that case: 



Case 


Probability 


Average contribution to X n 


(1) 


1 -p~ l 


1 


(2) (a) 


— 1 —2 
P — P 


1 


(2)(b) 


— 2 —3 
P — p 





(2)(c) 


p-3 


sum of p copies of X n+ i. 



Let us explain the entries in the last column. In case (1), the smooth F p -points of U with 
x = correspond to square roots of a uniformly random element of F p ; the expected number 
of square roots is 1. In case (2) (a) for p ^ 2, the contribution is the expected number of 
square roots of a random element of F*, which is again 1. In case (2) (a) for p = 2, in 
subcase (A) the contribution is (we are counting smooth F p -points), while in subcase (B) 
the average contribution is 2 since each point of A 2 (F P ) has a 1/2 chance of lying in U'(¥ p ); 
thus the overall average contribution in case (2) (a) for p = 2 is again 1. In case (2)(b), 
the contribution is 0. Case (2)(c) corresponds to S n , so we use Lemma 9.4. Moreover, the 
analysis shows that X n < 4 outside of case (2)(c). 

The upshot is that X n is given by a process whose parameters are independent of n; 
it would be a Bienayme-Galton-Watson process if the p random variables in Lemma 9.4 
were independent (they are generally not). For real B > 4, the only way that X n > B 
can hold is if we are in case (2)(c) and one of the p copies of X n+ \ exceeds B/p; thus 
P(A"„ > B) < p~ 3 pP(X n+ i > B/p). Iterate this k times, where k is the first integer with 
B/p k < 4, to obtain 

P(X n >B) < p~ 2k T>(X n+k >B/p k ) < p~ 2k = 0(B- 2 ). 

Thus the average EX n = Y^b=i > B) is finite and bounded independently of n. The 

table implies that 

EX n = (1 - p- 1 ) ■ 1 + (p- 1 - p~ 2 ) ■ 1 + (p" 2 - p~ 3 ) .0 +p - 3 (p. EX n+1 ). 

By induction on k, we obtain EA n = 1 — p~ 2k + p~ 2k ■ EA n+ fc. Taking the limit as k — » oo 
yields EX n = 1. □ 
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Lemma 9.6. We have E#£> smooth (F p ) =p + l. 

Proof. First, T> has the smooth F p -point oo on the standard compactification. The other 
points of £> smooth (F p ) lie above the initial U. The average number of these lying above F p - 
points in U with x = c is independent of c G F p , so it equals its value for c = 0, which by 
definition is EX , which is 1 by Lemma 9.5. Thus E#£> smooth (F p ) = 1 + J2 C & P 1 = P+ D 

This completes the proof of Theorem 9.1. 

Remark 9.7. One can show that there is a positive probability that the morphism £ — > C in 
the proof of Lemma 9.2 involves blowing up a smooth F p -point; thus E#C smooth (F p ) < p+1. 

Remark 9.8. On the other hand, at least for p > 2, one can show that a random C G T g with 
good reduction has E^C smooth (F p ) = p+1. (This is because all such C arise from a curve in 
T g with discriminant in Z* by a substitution x d + p n x for a uniquely determined n > 
and d G {0, 1, . . . ,p n — 1}, and the expected value of C(F p ) for C G J-* s (F p ) is p + 1, as one 
sees by grouping each C with its quadratic twist.) Thus one has the counterintuitive fact 
that on average, curves with bad reduction have fewer smooth F p -points than curves with 
good reduction! 

Remark 9.9. The argument proving Theorem 9.1 proves E#C smooth (F p ) < p + 1 also for the 
random nice curve over Q p given by 

V 2 = a 2g +2X 29+2 H h a 

for ao, • • • , a 2g +2 G Z, p such that the discriminant is nonzero. 

Corollary 9.10. For C G ^(Zp), the average size of plog(C(Q p )) is at most 

'6# + 9, if p = 2, 

P -Z^(2g - 2) + (p + l)\ ifp>2. 
p — 2 

Proof. Combine Proposition 5.4 and Theorem 9.1. □ 

Remark 9.11. Using the refinement mentioned in Remark 5.5, one can improve this to 3g+9/2 
for p = 2 and 

p 2 -p (P+1) 2 
J^2 i9 ~ 1) + ^2— 

for odd p. 

10. The main results 
10.1. Positive density for g > 3. 

Lemma 10.1. Let J be an abelian scheme over"L p . Define log: J~{Q P ) — > Q p by integrating 
a Z p -basis ofR^J,^^). If J(¥ p )[p] = and J(Q p )[p] = 0, then log{J{Q p f) = (pZ p ) 9 . 

Proof. Let F be the formal group of J over Z p . For e > 1, define K e := F((p e Z p ) 9 ) = 
ker( l 7(Z p ) j7"(Z/p e Z)). Since i\i is a pro-p-group and j7"(Q p )[p] = 0, we have (i\i) t0 rs = 0. 
Thus log |xi is injective. Hence for e > 2, 

(log K e _ x : \ogK e ) = (K e ^ : K e ) = ((p^Zp) 9 : (p e Z p ) 9 ) = p 9 . 
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Also, plog-Ke-i C logK e . On the other hand, logK e is a compact open subgroup of Q 9 , 
and hence a free Z p -module of rank g. The previous three sentences show that log-fT e _i = 
p" 1 log K e . 

We prove that log-K^ = (p e Z p ) 9 for all e > 1, by reverse induction on e. For large e, we 

have log K e = (p e Z p ) 9 since the derivative of the composition (pZ p ) 5 — > <J(Q P ) — ^ Q p at 
is invertible over Z p . The previous paragraph lets us pass from e to e — 1. 

In particular, logi^i = (pZ p ) 9 . Since (J7"(Q P ) : K\) = #J7"(F P ), which is prime to p, we 
have log(J r (Q p )) = {pZ p ) 9 as well. □ 

Lemma 10.2. For each g > 1, there exists C G F g \ Z such that #plog(C(Q 2 )) = 1. 

Proof. Let C G T g be a curve isomorphic to ?/ 2 + y = x 29+1 + x + 1. Completing the square 
yields a new equation for C of the form y 2 = f(x). The 2-adic Newton polygon of / is a 
single line segment from (0, —2) to (2g + 1,0), which has no interior lattice points, so / is 
irreducible over Q 2 . Elements of J(Q 2 )[2] correspond to partitions of the set of zeros of / 
into two parts, but only the trivial partition is Gal(Q 2 /Q 2 )-invariant, so J(Q 2 )[2] = 0. 

The curve C has good reduction at 2. Let C and J be the smooth proper models of C 
and J over Z 2 . The hyperelliptic involution of Cf 2 is (x, y) i— > (x, y + 1), which fixes only 
oo; this implies that J r (F 2 )[2] = 0, so J r (F 2 )[2] = 0. Lemma 10.1 implies that a basis of 
H (.7,fi^ /Z2 ) defines log such that log(J(Q 2 )) = (2Z 2 ) 9 . We may divide by 2 if desired to 
make log(J(Q 2 )) = Z 2 , but this will not change the map plog. 

The change of variable s := 1/x and t ■= y/x 9+1 rewrites the equation of C as t 2 + s 9+1 t = 
s 2g+2 _|_ s 2g+i _|_ s ^ corresponding to (0,0) in the new model. Let U\ := dt/(—(g + 

l)s 9 t + (2g + 2)s 29+1 + (2g + l)s 29 + 1), the denominator being a partial derivative of the 
curve equation. Then tOj := s^ l u\ for j = 1, . . . , g form a basis for H°(C, f2^ Za ). We have 
C(F 2 ) = {oo}, so there is just one residue disk, and t is a uniformizer for it. Expanding in 

power series in t, we find s = t 2 + t 2g+ ' 3 + ■ ■ ■ , U\ = (1 + Ot H ) dt, and Uj = (t 2j ~ 2 H ) dt 

for j = 1, . . . , g, with all coefficients in Z 2 . Integrating yields 

£(t) = (t + 0t 2 + -- - ,t 3 /3 + --- ,t 29 - 1 /{2g + l) + ■■■). 

After removing the common factor of t, for t G 2Z 2 each summand on the right is in 
2Z 2 except the initial 1; thus p(£(t)) = (1 : : ■ ■ • : 0) for all nonzero t G 2Z 2 . Hence 
plog(C(Q 2 )) = {(1 : : ■ • • : 0)}. The computation shows also that the only zero of log on 
C(Q 2 ) is oo, so C(Q 2 ) n J(Q 2 )tors = {oo}, so C £ Z. □ 

Theorem 10.3. Fix g > 3. The lower density of the set of curves C G T g satisfying 
C(Q) = C(Q 2 ) n J(Q) = {oo} positive. 

Proof. Proposition 8.7 yields U C ^(Zp) \ Z containing the curve of Lemma 10.2 and 
satisfying the hypothesis of Proposition 8.13. By Proposition 8.13, C(Q) = C(Q 2 ) D J(Q) = 
{oo} for C E J-'g C\ U outside a subset of relative upper density at most 2 • 2 1 " 9 < 1. □ 

Remark 10.4. Eq 2 (3) would imply that the lower density of the set of C G J-" 2 satisfying 
C(Q) = {°°} is positive. This can be proved by a similar argument, using curves 3-adically 
close to y 2 = x 5 — x 3 — 1. 



10.2. Density tending to 1. 
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Theorem 10.5. Fix g > 1. Then the lower density of the set of curves C G T g satisfying 
C(Q) = C(Q 2 ) n J(Q) = {00} at /east 1 - (12g + 20)2~ 9 . 

Proof. Apply Proposition 8.13 to each congruence class in Remark 8.14, and sum the results 
by using Corollary 9.10 for p = 2: the upper density of curves not satisfying the condition 
is at most (1 + (6g + 9)) 2 l s = (12g + 20)2"». □ 

Remark 10.6. We have 1 - (12^ + 20)2^ > if and only if g > 7. Also, 1 - (12^ + 20)2^ 1 
as g 00. 

Using the refinement given in Remark 9.11, the bound can be improved to 1 — (6(7 + 11)2~ 9 , 
which is positive also for g = 6. 

Theorem 10.7. Fix g > 1 and an odd prime p. Assume Eq g (p). Then the lower density of 
the set of curves C G T g satisfying C(Q) = {00} is at least 

1 - (l + (p + I) 2 + J^J&9 ~ 2))p W 

Proof Repeat the proof of Theorem 10.5, using the bound for odd p in Proposition 5.4. □ 

Corollary 10.8. Fix g > 4. Assume that Eq g (p) holds for arbitrarily large primes p. Then 
the set of curves C G T g satisfying C(Q) = {00} has density 1. 

Proof. The lower bound in Theorem 10.7 tends to 1 as p — > 00. □ 

Remark 10.9. For g = 3, the lower bound tends to from below, but using Remark 9.11 to 
cut the subtracted term essentially in half, we would obtain a limit of 1/2. Thus if Eq 3 (p) 
holds for arbitrarily large primes p, then the set of curves C G J-3 satisfying C(Q) = {00} 
has lower density at least 1/2. 

Remark 10.10. Several authors have presented heuristics or conditional proofs that suggest 
that in an algebraic family of curves of genus greater than 1, the density of those having 
rational points other than points that exist generically is 0: see [PV04, Conjecture 2.2], 
[P00O6], [Gra07, Conjecture 1.3(h)], and [Sto09, Conjecture 1]. 

10.3. Effectivity. Let J r | ood be the set of C G T g such that a is injective, the images 
plog(C(Q2)) and P<r(Sel2 J) are disjoint, and C(Q2) contains no nontrivial torsion point of 
odd order. Our proof of Theorem 10.3 (resp., Theorem 10.5) can be summarized as follows: 

• if C G .Ff ood , then Chabauty's method at the prime 2 proves that C(Q) = {00}; 

• J r | ood has positive lower density if g > 3 (resp., density at least 1 — (12g + 20)2~ 9 for 
each g > 1). 

Theorem 10.11. There is an algorithm that takes as input an integer g > 1 and a curve 
C G T g and decides whether or not C G J-"| ood . 

Proof. One such algorithm (not an especially efficient one) proceeds as follows. Let Q' 2 be 
the subfield of Q 2 consisting of elements algebraic over Q. The advantage of Q' 2 over Q 2 
from the algorithmic point of view is that an element of Q' 2 can be specified exactly with a 
finite amount of data. On the other hand, Q' 2 approximates Q 2 well in the sense that C(Q 2 ) 
is dense in C(Q 2 ) and J(Q 2 ) is dense in J(Q 2 ). 
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First, use [PooOl] to compute the finite set C(Q) R J (Q) tors- Check each of its points for 
membership in C(Q2) to compute T := C(Q2) H J(Q2)tors- Since J(Q2)tors C ^(Q' 2 ), we have 
T C C(Q 2 ). If any T G T is of odd order greater than 1, then return "no". 

Next, use [StoOl] to compute the group Sel 2 J, the group J(Q 2 )/2J(Q 2 ), and the map 
between them. Compute J(Q 2 ) t0 rs an d its image in </(Q 2 )/2J(Q 2 ). If the map a: Sel 2 J — > 
^(Q2)/(2J(Q 2 ) + </(Q2)tors) — ^2 ^ s n °t injective, then return "no". Otherwise compute 
Pa(Sel 2 J). 

Finally, we need to compute plog(C(Q 2 )). Choose degree divisors D ly . . . , D g on Cq> 2 
representing an F 2 -basis for J(Q 2 )/(2J(Q 2 ) + </(Q2)tors)- Choose any Q-basis u[, . . . ,oj' for 

H°(C, fi 1 ), say := x^~ l dx/y. We can compute the integrals j Dl a/- G Q2 to any desired 
precision by integrating formal power series and using the group law on J (see [MP10, 
Section 8.3], for example). Compute them to sufficient precision that we can find a new 
basis u>i, . . . ,oo g of H°(C, Vt l ) guaranteed to make the matrix (J Dl ojj) lie in GL 9 (Z 2 ). This 
new basis defines a homomorphism log as in Section 4. 

For each T G T, choose a uniformizer t on C<q> at T. For each sufficiently small value 
of t in Q2, let Pt be the corresponding point of (7(Q 2 ) near T. Each coordinate of P% is a 
power series in Q 2 [[^]] that can be calculated in the sense that any desired coefficient can 
be calculated. The same is true for the power series £i(t) := f^Ui for each i. Each i^t) 
vanishes at t — since T is torsion, but some Ui is nonvanishing at T so the £i(t) do not 
all vanish to order 2. Using Hensel's lemma, we can control the rate of convergence of all 
these power series in order to compute an explicit open and closed neighborhood of in Q 2 
on which ^(i -1 ^!^), . . . ,t~ 1 £ g (t)) converges and is constant. Then plog is constant on the 
corresponding explicit neighborhood of T in C(Q2). Such a neighborhood can be specified 
explicitly as a fiber of the map C((Q>2) = C(Zy — > C(Z/2 6 Z) for some e, where C is an explicit 
proper Z2-model of C, say the Weierstrass model. Let U be the union of these neighborhoods 
as T varies; thus we know plog(£7). 

On C(Q 2 ) \ U, log is bounded away from 0. Therefore, for any P G C(Q 2 ) \ U, the value 
plog(P) can be computed, and the computation examines only finitely many 2-adic digits 
of the coefficients of P; in other words, the computation succeeds with the same result for 
all points in a fiber of C((Q>2) — £(^2) — > C(Z/2 e Z) for some e. Since C(Q2) \ U is compact, 
it can be covered by finitely many such fibers. For e = 1, 2, ... in turn, attempt to calculate 
plog(P) for one point P in each nonempty fiber of C(Q 2 ) \ U — > C(Z/2 e Z) using only the 
precision specified by the image of P in C(Z/2 e Z). We may fail for the first few e, but the 
compactness argument guarantees that eventually a successful e will be found, and then we 

knowplog(C(Q 2 )\^)- 

Taking the union of plog(C7) and plog(C(Q 2 ) \ U) yields plog(C(Q 2 ))- Return "yes" or 
"no" according to whether plog(C(Q2)) and P<r(Sel2 J) are disjoint. □ 

Corollary 10.12. There is an algorithm based on Chabauty's method at the prime 2 that 
succeeds in determining C(Q) for a computable set of curves C G T g of lower density at least 
the bound in Theorem 10.3 or Theorem 10.5 for any g > 3. 
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